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1^ ' A useful property of a network that can be used to characterize many systems is the degree dis- 

tribution. However, many complex networks exhibit higher-order degree correlations that must be 
studied through other means, such as clustering coefficients, the Newman r factor, and the average 
7^ ' nearest neighbour degree (ANND). In this paper we develop an expansion of the conditional proba- 

C^H, bility that can be used to parameterize such degree correlations. The measures of degree correlations 

fj ■ associated with this expansion can be used to signal the presence of non-linear correlations. 
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I. INTRODUCTION 

Interest in the applications of networks to technological, biological, and social systems has grown significantly in 
recent years [l|-l3|. One property of networks that has received much attention is the degree distribution, which 
characterizes the number of edges that connect a given node. Many networks show a power-law form of the degree 
distribution, while other networks may have an exponential or truncated power-law distribution. However, there is a 
further property of networks that is not completely captured by the degree distribution alone: it is found that most 
networks have a high degree of clustering, with nodes tending to share many common connections. 

A number of ways have been developed to characterize such correlations. One standard approach was formulated 
by Newman f4u7l|, who introduced what is now called the Newman factor r. This number is essentially the Pearson 
correlation coefficient of degrees from connected vertices in a network and is fully defined by two-point correlations 
in a network. The Newman factor is normalized to lie in the interval — 1 < r < 1, and is defined so that positive 
(negative) values indicate that vertices with the same (different) degree tend to be connected, which indicate assortative 
(disassortative) mixing. A Newman factor of means no correlations are present. Most networks have a non-trivial 
Newman factor: biological networks tend to show negative r values, technological networks display values close to 
zero, while social networks tend to have rather large positive values [3|. Another, related measure of correlations that 
is commonly used is the average nearest neighbour degree (ANND) fc„„(A:), which is a function in principle of the 
degree k Q. For uncorrelated networks, the ANND is independent of fc, and thus an explicit k dependence indicates 
the presence of degree correlations. A A:„„(fc) that is an increasing function of k shows assortative mixing, while a 
decreasing function of k indicates disassortative mixing. A third commonly used measure of correlations are clustering 
coefficients f3| , which can be related to these other measures under certain assumptions [3, [l3| ■ 

However, some of these measures may not completely capture all correlations that may be present in a network. For 
example, the Pearson correlation coefficient by definition only explores linear relationships between two variables [ll| . 
In this paper, we develop a formalism that in principle can be used to measure higher-order non-linear correlations. 
Wc do this by examining an expansion of P(fc,A;'), the joint probability that an edge chosen at random connects 
vertices of degree k and k' . In Section [H] we develop this expansion, through which we can define generalizations of 
the Newman r factor and the ANND knn{k) which are sensitive to non-linear correlations. The expansion developed 
in this section involves certain input coefficients; in Section [IIII we show how to relate this expansion to one involving 
the specification of certain generalized ANND functions. In Section [TVl we use the results of Ref. [1(J| to show how 
these arise in defining clustering coefficients. In Section |V] we illustrate the use and effects of these generalizations in 
some simple examples. Section IVTl contains some brief conclusions. 

II. EXPANSION OF THE CONDITIONAL PROBABILITY FUNCTION 

Let P{k) denote the probability of finding a node of degree k, P{k'\k) the conditional probability that a vertex of 
degree k is connected to a vertex of degree k\ and P(fc, k') = P(fc', k) be the joint probability that a randomly chosen 
edge connects vertices of degree k and k' . These satisfy the normalization conditions 

^P(A:) = ^P(A:'|A:) = 1 (1) 

k k' 

as well as the detailed balance equation 

kP{k'\k)P{k) = k'P{k\k')P{k') (2) 

Let us now introduce the edge distribution [7[ 

Pe(fc)-^P(fc',A:) (3) 

k' 

This allows us to express the joint probability P{k' , k) to the conditional probability P{k'\k) as 

P{k',k) = P{k'\k)P,{k) (4) 

If we now define, for any function f{k), 

j(k)^Y.f^^)p^^) 

k 

(/(fc))=^/(fc)P,(fc) (5) 



we can relate the edge distribution Pe{k) to the degree distribution P{k) as 



P,(fc) = =P{k) 
k 

In terms of Pe{k), the detailed balance condition of Eq. 1^ becomes 

Pik'\k)P,ik) = P{k\k')P,ik') 



(6) 



(7) 



We now examine the form of the conditional probability P{k'\k). First note that, if the network were uncorrelated, 
we would have 



P,,,{k'\k) = ^^^^ = P,{k') 
k 

This suggests that we can put P{k'\k) into the form 

P{k'\k)^P,{k') + Q{k'\k) 
for some function Q{k'\k). The detailed balance condition of Eq. ([7]) implies 

Q{k'\k)Pe{k) ^ Q{k\k')Pe{k') ^ Q{k'\k) = Pe{k')Q{k' ,k) 
where Q{k', k) = Q{k, k') is symmetric in k and k' . With this, we can then put P{k'\k) of Eq. ^ into the form 



subject to the condition 



P{k'\k)=P,{k')[l + Q{k',k)] 

^g(fc',fc)Pe(fc')=0 



(8) 

(9) 
(10) 

(11) 

(12) 



which follows from Eq. ([IJ. 

Let us assume that we can expand the function Q{r, s) in in terms of symmetric polynomials of r and s: 



Q{r, s) = Eoir, s) + Ei{r, s) + E2{r, s) + Eair, s) + Ei{r, s) + 



(13) 



where 



Eo{r,s 
Ei{r,s 
E2{r,s 
E3ir,s 
Ei{r,s 



'^00 

:e«(r + .) 



= eil)(r4 + s^) + e^tV's + rs^) + e^'^^s^ (14) 

and e^- are constant coefficients. The constraint of Eq. p^ will lead to conditions on certain coefficients of these 
expansions to a given order. This will allow us to write the expansion of Q{m, n) to the t*^ order as 



Q(*)(r,s) = ^^ayW,(r)t;j(s) 
1=1 i=i 



r - (fc")] 



(15) 



where we have introduced 

Vnik) = - 

CTy = {{k' - {k')){y - (F)) = (F(F - (F))) = {k'+i) - (fc^)(F) (16) 

Note that (w;) = and {viVi) — 1 for i = 1, 2, . . . , n, but (viVj) ^ for i ^ j. Through the Gram-Schmidt procedure, 
we can develop an orthonormal basis Ui{k) satisfying (ui) — and (uiUj) ~ Sij from the Vi{k) as 



Ui{k) = 



Urjk) 



(17) 



where 

C/i(fc) = «i(fc) 

U2{k) = W2(fc) - {uiV2)ui{k) 

Usik) = V3{k) - {uiV3)ui{k) - {u2V3)u2{k) 

n—X 

U.n{k) = v„(fc) - ^(M,z;„)u,(fc) (18) 

The relations of Eqs. ([T7lfT8|) ahow us to express the Vi{k) in terms of the Ui{k): 

n 
Vn{k) =^^{UjVn)u-j{k) (19) 

This allows us to express the expansion of Q{s, t) of Eq. P^ as 

t t 

a=l 6=1 
t t a b 

"" XI XUI 51 "'»'' ^"*"«^ {ujVb)u,{r)uj (s) 

a=l 6=1 i=l j = l 
t i t t 

= ^ ^ ^ ^ aab (uiUa) {ujVb)6{a - i)6'(6 - j)u^{r)uj (s) 

a=l 6=1 i=l j = l 
t t 

j=i j=i 

where 9{x) = 1 if a; > 1 and zero otherwise, and 

t t 
Pi] = E E ""'' {uiVa){ujVb)0{a - i)9{b - j) (21) 

a=l 6=1 

The coefficients /J^ can be interpreted in terms of generalized correlation coefficients as follows. We have, from 
Eq. (PO)) and the fact that (uiUj) = Sij, the relationship 

^^7/,(r)u6(s)gW(r,s)Pe(r)Pe(s) = Pab (22) 

r s 

We now introduce a generalized a*''-order ANND knn{k) as 

t 

fcin^(fc) = E"-(^')^(^'l^) = E"«(^')Q**^(^''^)^^(^') = E/5-^-^^W (23) 

fc' fc' j=l 

where we have used Eqs.([TT1 [20|) and {ui{k)) = 0. This allows us to write Q^^^r, s) of Eq. ([20]) as 

t t 

Q^'Hr,s) = ^«,(r)fcW(.s) = Y. ^^nlirhd^) (24) 

i=l 1=1 

and also allows us to express Eq. (P^ as 

/3a6 = (fci"j(fc)^i6(fc)) (25) 

We can then use this to introduce generalized correlation coefficients Vab to order t: 

((fcin - (fcin))(Wb - (Ub))) {kiViUb) Pab ,^„. 

rab = I = , = -5- (26) 

\J{{kii-{ki^^W)^{{Ub-{ub)Y) V((fct^)2) ^° 



where a, b — 1,2, ... ,t, we have used {knn) = = (u^), and we have defined 



Ra = sjiiki-^r) 












The definition of Tab in Eq. (j26p satisfies — 1 < raj, < 1, which follows from the Cauchy-Schwarz inequality 

\{{X {X)){Y (r)))|^ < {{X {X)f){{Y - {Y)f) 
for two variables X and Y . We can then write the expansion of (3'*''(r, s) of Eq. ((20)) as 

Q'^^Hr,s) ^^^RarabUa{r)Ub{s) 
a=l b=l 



(27) 



(28) 



(29) 



with the condition Rafab — Rbfba coming from (iab — Pba- Note that, to a given order t, the Tab are not independent, 



since by Eq. (^5)) we have r^^ + r^j 
parameterize the rab as 



rak 



1. This allows us to introduce t — 1 hyperspherical angles 9ak and 



^a(t-l) 
rat 



1 



( Pal \ ( 

Pa2 
Pa3 



Pak 



COsOal 

Sm0alCOs9a2 

sin0alS[li0a2COs9a3 

{U'^~lsm9a^)cOSeak 



^a(t-l) I sin0al---Sm0a{t^2)COS0a{t_i) 

\ fiat I y sin6lQi---sin6'a(4_2)Sin6lQ(t_i) y 



(30) 



This form of the Tab, as appearing in the expansion of Eq. (|29p. makes it explicit that it is the combination of the fiab 
parameters that appear in Ra that sets the scale for Q^^^{r, s). 

It is straightforward to see how the usual ANND fc„„(fc), and related Newman correlation coefficient r, are related 

to the generalized knn{k) of Eq. ([23)) and Vab of Eq. (l26l) . Suppose we keep only the leading-order t = 1 term in the 
expansion of Q'^'^^r, s) in Eq. (|T5)) : 



where, from Eq. (|T5)) . 



^^i(fc) 



'■^\r,s) = aiivi{r)vi{s) 
k-{k) k- (k) 



(31) 



(32) 



VoTT ^(fc2) - (fc)2 

In this case only the orthonormal basis vector ui(fc) = ui(fc) of Eq. ()17p would arise, leading to the expansion 

Q(i)(r,s)=/3nui(r)ui(s) (33) 

The only generalized ANND function of Eq. (|25)) that would appear would be 



ki'}ik) = Y,u,ik')Pik'\k) ^ ^Y.ik' - {k'))p{k'\k) = -1 



/CTll 



^/c'P(fc'|fc)-(/c) 



Since the standard ANND /c„„(fc) is defined as 



fcnn(fc)=^fc'-P(fc'|fc) 



(34) 



(35) 



which satisfies (fc„„(fc)) = (fc), we then have the relationship 






(36) 



We can also relate the standard Newman correlation factor r to the corresponding generalization of Eq. ((26)) . The 
Newman factor r can be written as [3] 



^p) ! ^fc)2 E ^'fc [^(fc'' fc) - ^eCfcO/'elfc)] 



1 
CTll 

1 

CTll 



^k'kP{k'\k)Pe{k) - {kf 



k\k 



1 
CTll 



^fc„„(fc)fcPe(fc)-(fc)^ 



((fc„„(fc)-(fc„„(fc)))(fc-(fc))) 



(37) 



where we have used Eq. (|4]) to relate the joint probability to the conditional probability, as well as {k„n{k)) — {k). 
Using Eq. (I55|) to relate knn{k) to knn{k), as well as Eq. (15^ to relate k to ui(fc), we then have 



— (V^)' (fcil^(fc)«i(fc)> = (fcil^(fc)«i(fc)> 



CTll 



The corresponding correlation coefficient of Eq. (j26p that would appear to this order is 



rii 



{{kL'} ^ {ki\l)){u, - (u,))) 



{f^nnUi 



^{{k^:i-{k^iw)^{{u,-{u{)Y) ^((fcl^ 1^1 



(38) 



(39) 



which allows us to identify r = Pn = rii|/3ii|. Note that, to this order, we have rn = ±1, which simply reflects the 
fact that in the linear approximation there is perfect linear correlation or anti-correlation between fc„„ and ui. 



III. SPECIFYING THE AVERAGE NEAREST NEIGHBOUR DEGREE 

An alternate, but related, approach to the expansion discussed here has been formulated by Weber and Porto [?], 
who examined how one could expand the joint probability in terms of a particular functional form for the ANND 
knn{k). In the current notation, this approach proceeds as follows. One begins with the functional form of Eq. (jlip 
for the conditional probability: 



subject to the constraint of Eq. (|12p : 



P{k'\k) = P,{k')[l + Q{k' ,k)] 



Y,Qik',k)Peik')=0 



(40) 



(41) 



One then assumes the symmetric function Q{k' , k) has the form 

Q{k',k) ^h{k')h{k) 
and expresses the ANND fc„„(fc) in terms of h{k) as 

k„n{k) = Y^ k'P{k'\k) = {k)+Y^ k'h{k')Pe{k')h{k) = (k) + {kh{k))h{k) 



(42) 



(43) 



which allows one to infer 



From this, one has 



h{k) = 



'^nn\k) \k) 

{kh{k)) 



{kh{k)) 



(fcfc„„(fc) - fc(fc)) 
{kh{k)) 



(khik)) ^ J {kknnik)) - {ky 



(44) 



(45) 



The function Q{k' , k) arising in the joint probabihty 

is thus specified by fc„„(fc). In order to respect the identity {knn{k)) — (fc), we can parameterize knn{k) as 

~knn{k) = J^gik) (47) 

for a suitable function g{k). 

This approach can be related to the expansion considered here. To do this, we first cast the expansion of Q{k' , fc) 
in Eq. (|^ as 

^^,^^^fc||(fcOfciM (48) 

(fciiHfc)^ii(fc)) 



and the parameterization of fc„„(fc) of Eq. (|47p as 



knnW — I — 



/CTll 



3(fc) _^ 



(.9(fc)> 



where Eqs.(|32l IM)) have been used. This is to be compared to the general expansion of Eq. ([24 



(49) 



QW(fc',fc) = ^^A,u.(fc')^,(fc) -5]w.(fc')fci^(fc) = E^™(^')"^W (50) 

i— 1 J — 1 2—1 i— 1 

Rather than considering the /3ij coefficients as input parameters to (5*^*)(fc',fc), we could, in analogy with Eq. (|47p . 
specify the ANND fcmi (fc) by some function /^"^(fc). In order to respect the normalization (fcmi (fc)) = 0, we write 

fcin^fc) =[,/'"' (fc)-(/^''Hfc))] (51) 

Introducing 



a/. = V^((/('>)(fc)-(/('^)(fc)))2) (52) 

we can write this as 

e'W^.,- ^"'^'-'^"'""' .^/-.'-'^ (53) 

SO that g^"'\k) has zero mean and unit norm. We then expand g^"'\k) in terms of the orthonormal basis Ma(fc) to 
some order t: 



g^''\k) =Y^c^:\,{k) (54) 



&=i 



with the coefficients o^ given by c^° = {ui{k)g'^°'\k)). Comparing this to the expansion of Eq. (l23l) of fcnn (fc) in 
terms of the /3ij allows us to identify 

Pab = ^r4") = a/o(^,(fc)g('^)(fc)) (55) 

In this approach, the order t to which one is working should be chosen so that the expansion in Eq. ([M)) of ^'^"-'(fc) in 
terms of the orthonormal basis Ui{k) is accurate. 

Note that, in analogy with the expansion of Q(fc', fc) of Eq. ((48]) developed by Weber and Porto Q, we could consider 
in the present approach a generalization of the expansion of Eq. (j24p : 

g(*)(fc', fc) = ^ k^i{k')um ^ E fiS^!^'"^!'^ (56) 

i=l i=l {knn(k)Ui[k)) 

As with any perturbative expansion, the potential advantages of this form over the expression of Eq. (j24p will depend 
on these higher-order terms being small, in some sense, to the unperturbed case. 



IV. CLUSTERING COEFFICIENTS 

Clustering measures correlations among 3 nodes in a network, and so requires knowledge of the conditional prob- 
ability P(fc',fc"|A:), which is the probability that a vertex of degree k is simultaneously connected to two vertices of 
degree k' and k" Q. For non-Markovian networks P(fc', k"\k) and P{k'\k) are unrelated, but for Markovian networks, 
we have the relation P(fc', k''\k) = P{k'\k)P{k"\k), so knowledge of the two-point probability distribution is sufficient. 
As shown by Dorogovtsev [10| , various measures of clustering in a network can be derived in this case taking into 
account degree correlations present in P{k'\k). Three related clustering coefficients can be introduced: 

• The degree-dependent local clustering coefficient C(fc): 

where (TO„„(fc)) is the average number of connections between the nearest neighbours of a vertex of degree k. 

• The mean clustering coefficient C: 

C-^P(fc)C(fc) (58) 



• The clustering coefficient C: 

T.kP{k){^nn{k)) 



c 



EfeP(fc)fc(fc-f)/2 fc((fc)-l) 



^^^Y.Kk-l)Pik)Cik) (59) 



As shown in Ref. [13] , assuming the size N of the network is large and there is "weak" clustering, the degree-dependent 
local clustering coefficient C(fc) of Eq. ([57)1 is given by 

C(^) = ]^ E P{q'\k)P{q\k)^^{q' - l){q - 1) (60) 

q,q' 

which, in terms of the expansion of P{k'\k) in terms of the function Q{k\ k) through Eq. pT|) . can be written as 

^(^) = 77T E ^^('?') [1 + Q('^'' ^)] P-^l) [1 + ^(9, k)] [I + Q{q', q)] {q' - l){q - 1) (61) 

iV k — ; 
q,q 

The mean clustering coefficient C and the clustering coefficient C then follow by Eqs.(|58 l[59| . Note that. In the limit 
of an uncorrelated network {Q{k' , k) — 0), we have 

N k 

Thus, a /c-dependence of C(fc), leading to differences between C(fc), C, and C, signals the presence of nearest- 
neighbour degree correlations. 

One can now use the expansion of Q(rn,n) of Eq. (|20p in order to express these clustering coefficients in terms of 
the expansion parameters /3ab- We first do so for the local coefficient C{k). When expanding Eq. (IFlT) . we see that 
there will be linear, quadratic, and cubic terms in Q{m, n) present. The linear terms can be evaluated using only the 
properties (ua) — and (uaUb) = Sab- However, for the quadratic and cubic terms, we will need to evaluate (uaUbUi). 
This can be done as follows. Let us express the relations of Eqs. p^fTO)) relating the basis vectors Ua(k) and Va{k) as 

m 

J = l 

n 

Vn{k) = '^dn;jU.j{k) (63) 

i=i 
We then have 

a a 

Ua{k)ui{k) = '^CajVj{k)ui{k) = '^CajVj{k)vi{k) (64) 



where we have used ui{k) — ui(fc). Using the definition of u„(fc) in Eq. (rT6|) . we have 

1 



VjVi = 



y/aJJa]J 
1 



[F - (F)] [k - (fc)] 



(65) 



In this we can then use the second relation of Eq. (j63p in this equation to express the terms involving Vi in terms of 
Ui, and then insert this into Eg. (1641) in order to find UaUi in terms of a series of terms linear in Ui. We find 



ia{k)ui(k) = ^ 



L-aj 



1 ^/^IWri 



J+1 



n/^V+IJ-kT 2_^ dj+i^rnUm{k) 



/a~{k) ^ djraUyn{k) - yCTir(F)Mi(fc) + (fc 



i+i\ 



(F)(fc) 



(66) 



We then have 



(UaUbUi) = ^ 



L-aj 



Jab 



t V^Tj^TI 



[V'^j+ij+i^i+i.b ^ \/^{k)djb - ^/^{k^)Sbl 



Introducing the notation 



{f{q,k)), = Y,Peiq)fiq,k) 



(67) 



(68) 



for a function f{q, k), one can then derive the following relations: 

(Q(g,fc)zii(<7)), = fcW(fc) 
{Qiq,k)ki^2iq)), = Y.f^-bk^nlik) 

b 
{Q{q',q)Q{q',k)),, =Y.ktr!{q)kiCr!{k) 

a 

{Qiq',q)Qiq',k)m{q')),, =J27abki^2iq)ki^^ik) 

{Qiq,k)ki^2iq)Mq)), = E/3-^''-^-(^) 

b,c 

Using ui{k) = ^(Tiiwi(fc) + ((fc) — 1), we then find the following contributions to C(k): 
J2Peiq)Peiq'){q-l)iq'-l) = m-l]' 

J2 Pe{q)Pe{q) [Q{q, k) + Q{q' , k) + Q(g,q')] (<Z - l)(<z' - 1) = ^ii/3ii + 2V^((fc) - l)fcW(fc) 
^ Pe{q)Pe{q) [Q{q, k)Q{q' , k) + Q(g', fc)Q(g, g') + Q{q, k)Q{q, q')] {q - l){q' - 1) = 



(69) 



9>9 

= cm 



[fcW (fc)] ' + 2V^((fc) - 1) ^ /3i„fct) (fc) + 2an ^ /^i.Tafefci^j (fc) 

a a.b 



Y,Pe{q)Pe{q')Q{q,q')Q{q,k)Q{q',k){q^l){q' - 1) = du ^ (3,d7aclbdkiil{k)ki'l{k) 



q,q' 



a^b^c^d 



+ 2V^((fc) - 1) ^ /3,,7befct)(fc)fcW(fc) + ((fc) - l)'^/3a.fct'(fc)fci'2(fc) 



(70) 



a,6,c 



ab 
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We then find the local clustering coefficient C{k), normalized to the non-correlated value Cnc of Eq. (|62|) . can be 
written as 



tP = 1 + ^ + E Gakiiiik) + Y, H,,kiiiik)ki'^ik) 



where 



a 



aiifi- 



(71) 



m - 1? 




^•^ - ((fc) % [^^" 1^1 J 1 (^^^ ,)2E/^i^^« 




<^1 



31b- 



2J Pcdlaclbd 



(72) 



To calculate the clustering coefficient C of Eq. (15^1) , we will need 



(fct)(fc)fci'2(fc)«i(fc)>fc =Y.P--P^dlcci (73) 



This leads to 



^ = 1 + J^ + 77|(^ E ^-^"1 + E ^""/^-/^b- + 77|(^ E HabPacPbdlcd (74) 

"^ ^^"' -^ a a,b,c ^^ ' ' a,b,c,d 

In order to get a qualitative sense of the behaviour of C(fc) of Eq. ([TT]) and C of Eq. (|74|). let us assume that the 
expansion parameters jSab are small, such that we need only keep terms up to linear order in them. Since by the 
definition of Eq. (f23|) the generalized ANND fc„„(fc) is linear in Pab, we find the local clustering coefficient C(fc) of 
Eq. ([7T|) could be approximated as 

a. ^'+((fc)-l)2 + ((fc)-l)'=""^^^ ^^'^ 

while the corresponding approximation of the clustering coefficient C of Eq. (fM)) is 

C 3aii^ii 

In the next section we consider a couple of examples to illustrate the behaviour of various quantities discussed in this 
and the previous sections. The full numerical results for the clustering coefficients suggest that, for for small /3ab, 
this linear approximation just considered gives at least give a fairly good qualitative picture for how the clustering 
coefficients behave. In particular, from Eq. (pT|) . the local degree-dependent clustering coefficient C(fc) has a k- 

dependence that follows approximately that of the first-order ANND fc„„(fc) (compare Fig. |4]and Fig. [5l and also 
Fig. |5] and Fig. ^. As well, the clustering coefficient C of Eq. ((75)) is increased/decreased slightly, relative to the 
non-correlated value, for positive/negative /3ii, corresponding to assortative/disassortative mixing (see the discussion 
immediately following, respectively. Fig. [5] and Fig. [S]). 

V. EXAMPLES 

In order to illustrate the relative size of the effects considered in the previous sections, in this Section we examine 
two examples of explicit choices of the parameters /3ab in the expansion of Q{k' , k) of Eq. pT|) . In keeping with the 
spirit of a perturbative approach, we expect that, if we keep relatively few terms in the expansion, the corrections to 
the uncorrelated case Q{k',k) = will be small in some sense. In both cases we use a degree distribution function 
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P{k) - fcT, with 7 = 
appears as in Fig. [1] 



-1.2, and consider degrees 1 < fc < 40. The uncorrelated joint probabihty Pnc{q\k) — Pe{q) 



P.M I k) 




0.02 
40 







FIG. 1. The uncorrelated conditional probability Pnc{q\k) for P{k) ~ fc^, with 7 = —1.2. 



The first case will be one exhibiting assortative mixing. For this we choose a single linear term /3ii ~ 0.07. The 
function Q{q, k) appears in Fig. [51 while the conditional probability P{q\k) — P^{q)[\ + Q{q, fc)] appears in Fig. [3] 



Q(q , k) 








FIG. 2. The expansion function Q{q,k) for /3ii = 0.07. 
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40 



FIG. 3. The conditional probability P{q\k) for /?ii = 0.07. 



As expected with assortative mixing, we see from the conditional probability of Fig. [3] that there is an increased 
likelihood of nodes with the same degree to be connected. Fig. S] shows the generalized ANND of Eg . ([25)) : in this 
case, only fc„„ (fc) will be non-zero, and it is a linear function. 



0.15 



0.05 




-0.05 



-0.15 



FIG. 4. The first-order ANND fcii^(fc) for /3ii = 0.07 



The only non-zero correlation coefficient Vab of Eq. ()26|1 is rn = 1, as expected, since the linear correlation is perfect 
in this case. Finally, in Fig. [S] we plot the degree-dependent local clustering coefficient C{k) of Eq. (I57)l . which has 
been normalized to the uncorrelated value Cnc of Eq. ([S^ . 



13 



1.25 



1.15 



1.05 




0.95 



0.85 



FIG. 5. The degree-dependent local clustering coefficient C{k)/Cnc for /3ii = 0.07, normalized to the uncorrelated value. 



Corresponding to this C(fc), we find the mean clustering coefficient C of Eq. (|55|) . normalized to C„c, to be 
C/Cnc = 0.94, while the clustering coefficient C of Eq. (jSH)) . also normalized to Cnc, to be C/Cnc = 1-10; this 
represents about a 10% change from the uncorrelated state. 

The second case we examine will be one exhibiting disassortative mixing. For this we consider including both linear 
and quadratic terms in the expansion of Q{q, k), and we choose fSu — —0.07, /3i2 ~ 0.02 — /32i, and /322 = —0.04. The 
function Q{q, k) appears in Fig. |6l while the conditional probability P{q\k) — Pe{q)[l + Q{q, k)] appears in Fig. [T] 
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FIG. 6. The expansion function Q{q,k) for /3ii 



-0.07, /3i2 = 0.02 = 1321, and ^22 = -0.04. 
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FIG. 7. The conditional probability P(g[fc) for Pn = -0.07, /3i2 = 0.02 = /32i, and ^22 = -0.04. 



As expected with disassortative mixing, in this case we see from the conditional probability of Fig. [7] that there is 
an increased likelihood of nodes of different degree to be connected. Fig. [5] shows the generalized ANND of Eg.ip^: 
in this case, both fc„„(fc) and fc„„(fc) will be non-zero. 
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FIG. 8. The first-order ANND k^-alik) and second-order ANND fci^J(fc) for /3ii = -0.07, /3i2 = 0.02 = /32i, and ^2 



-0.04. 



It is found that the only non-zero correlation coefficients Vah of Eq. ([26|) are r\\ — —0.96, r\2 — 0.27, r2i — 0.45, 
and Til = —0.89. Finally, in Fig. IHlwe plot the degree-dependent local clustering coefficient C(fc) of Eq. ([57]) . which 
has been normalized to the uncorrelated value Cnc of Eq. (|62l) . 
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FIG. 9. The degree-dependent local clustering coefficient C{k)/Cnc for j3\\ 
normalized to the uncorrelated value. 



-0.07, /?i2 = 0.02 = /321, and /322 



-0.04, 



Corresponding to this C(/c), we find the mean clustering coefficient C of Eq. (|58|) . normalized to Cnc, to be 
C /Cnc — 1.08, while the clustering coefficient C of Eq. (f59|) . also normalized to Cnc: to be C/Cnc — 0.91; this 
represents about a 10% change from tlie uncorrelated state. 



VI. CONCLUSIONS 



We have examined an expansion of the conditional probability P{k' , k) = Pe{k')[l + Q{k', k)] about the uncorrelated 
case Pncik'\k) = Pe{k') in terms of symmetric polynomials in k' and k. Setting aside the question of convergence, we 
find a systematic expansion is possible, and will involve expansion coefficients /3a&- Having specified these coefficients 
up to a certain order, the usual measures of nearest-neighbour degree correlations - the Average Nearest Neighbour 
Degree ANND, Pearson-inspired correlation coefficients, and various clustering coefficients - can be calculated. In 
the present case, since non-linear terms in the expansion would in principle appear, appropriate generalizations of 
these measures of degree correlations were introduced. 

One possible use for the type of expansion discussed in this paper might be as a means to estimate qualitatively 
the effect of nearest-neighbour degree correlations in models describing the evolution of states of nodes on specific 
networks - for example, the propagation of disease on the network. This can be formulated in terms of the equations 
governing the evolution of probabilities.. This can come by formulating the equations governing the evolution in terms 
of probabilities. To see this in a general sense, let px,k be the probability that a node of degree fc is in a state X at 
time t. A differential equation describing the time evolution of px,k might then contain terms such as 



dpx,i 
dt 



9PX,kPY,k 



(77) 



which would describe how the transition X —> Y in the system, parameterized by a rate g, affects the probability 
Px,k- Nearest-neighbour degree correlations could be incorporated into this model by consideration of the interaction 
term [H 



dpx.k 
dt 



gkpx,kQ>Y,k 



where 



Q^.fe = E ^^-T^-P{k'\k)py,^ 



(78) 
(79) 
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is the probability that a neighbour of the node, chosen randomly from amongst its k neighbours, is in a state Y. 
Thus, specifying a conditional probability function P{k'\k) that differs from the uncorrelated case Pnc{k'\k) = Pe{k') 
would allow one to see the effects of different types of degree correlations in this model. 

If one is to use the expansion of Q{k',k) of Eq. (|20l) in specifying a conditional probability P{k'\k) containing 
correlations, one must decide on the values of the expansion parameters /3afc to use, and at what point does one know 
that enough terms have been kept. A mild constraint on the expansion is that, being a perturbative expansion, the 
corrections about the unperturbed case should be small, and that inclusion of higher-order effects should not affect 
significantly the results of the presumably more important lower-order terms. An equivalent statement of this is that, 
for a given set of parameters, changing them slightly would not change the overall qualitative picture. It may be 
possible, by examining classes of real networks, to be able to say something about the relative magnitude of the various 
/3a6 coefficients. However, if one wanted just a qualitative estimate of the relative effects of such degree correlations 
in the model under consideration, then one could consider a "small" number of terms in this expansion with "small" 
values of the Pab parameters, where "small" in this context is defined through the constraint that the effects do not 
significantly alter the uncorrelated case. The examples of Section |V] show that a reasonable set of parameters can be 
chosen which incorporate various types of degree correlations. Such an approach would not allow one to say anything 
quantitative about a real network, but it would allow one to decide, with some degree of confidence, whether or not 
inclusion of degree correlations in the model might lead to significant effects, and thus would be worthy of further, 
more detailed, study. 
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